Humans, Machines, and Conversations: An Ethnographic Study of the Making of Automatic Speech Recognition Technologies

نویسنده

Adelheid Voskuhl

چکیده

This essay investigates the design of automatic speech recognition (ASR) technologies as a site at which the human qualities of ‘hearing’ and ‘understanding’ are mimicked in machines. On the basis of an observational study, it explains the major work components and debates in ASR, such as research paradigms and strategies, as well as the intricacies of instruments and experimentation. Key arguments concern the tensions, conflicts, and ambiguities that emerge from the intersection of disparate research paradigms in ASR. The study emphasizes the actors’ success in establishing a local context of stable, practical rationality in which they negotiate the complementary and conflicting research objectives of building speech recognition machines, on one hand, and understanding human hearing, on the other. The conclusion is cast in terms of a ‘functional contingency’, a concept that characterizes this setting in which researchers successfully make machines ‘recognize speech’ and in so doing make the machines part of specific types of ‘conversation’.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

Designing and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods

For many years, speech has been the most natural and efficient means of information exchange for human beings. With the advancement of technology and the prevalence of computer usage, the design and production of speech recognition systems have been considered by researchers. Among this, lip-reading techniques encountered with many challenges for speech recognition, that one of the challenges b...

متن کامل

A Comparative Study of Gender and Age Classification in Speech Signals

Accurate gender classification is useful in speech and speaker recognition as well as speech emotion classification, because a better performance has been reported when separate acoustic models are employed for males and females. Gender classification is also apparent in face recognition, video summarization, human-robot interaction, etc. Although gender classification is rather mature in a...

متن کامل

The Effect of Radio Waves on the Quality and Safety of Wearable Sensors in Healthcare

The industrial Internet of Things (IoT) is aiming to interconnect humans, machines, materials, processes and services in a network. Wireless Sensor Network (WSN) comprises the less power consuming, light weight and effective Sensor Nodes (SNs) for higher network performance. Radio Frequency Identification (RFID) and sensor networks are both wireless technologies that provide limitless future po...

متن کامل

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2004

Humans, Machines, and Conversations: An Ethnographic Study of the Making of Automatic Speech Recognition Technologies

نویسنده

چکیده

منابع مشابه

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Designing and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods

A Comparative Study of Gender and Age Classification in Speech Signals

The Effect of Radio Waves on the Quality and Safety of Wearable Sensors in Healthcare

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

عنوان ژورنال:

اشتراک گذاری